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f  a  \  Abstract 

PA  ■()<'  - 

\.  i  /  x  bLi) 

Consider  k  populations  (xj where  an  observation  from  population fKi  has  a 

^  ^ 1  f'r/  fVk", 

binomial  distribution  with  parameters  N  and  p$  (unknown).  Let  =,/max  pj.  A  popu- 

r  -  „  i‘‘ 

lation  %  with  p,-  =  p[*j  is  called  a  best  population.  We  are  interested  in  selecting  the  best 

r  (,  I*  3-  '!<<) 

population.  Let  p  =  (pi,...,pt)  and  let  a  denote  the  index  of  the  selected  population. 

/  f</4  c 

Under  the  loss  function  L(p, a)  =  f[k\  -  Pa,  this  statistical  selection  problem  is  studied 


\B±. 


.... 


via  a  parametric  empirical  Bayes  approach.  It  is  assumed  that  the  binomial  parameters 
py,  t  =  l,...,fc,  follow  some  conjugate  beta  prior  distributions  with  unknown  hyperpa¬ 
rameters.  Under  the  binomial-beta  statistical  framework,  an  empirical  Bayes  selection  rule 
is  proposed.  It  is  shown  that  the  Bayes  risk  of  the  proposed  empirical  Bayes  selection  rule 
converges  to  the  corresponding  minimum  Bayes  risk  with  rates  of  convergence  at  least  of 
order  0(exp(— cn))  for  some  positive  constant  c,  where  n  is  the  number  of  accumulated 
past  experience  (observations)  at  hand.  ,  (r  ^ 
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1.  Introduction 


Consider  k  populations  JTi, .. .  where  an  observation  from  jt,  has  a  binomial  dis¬ 
tribution  with  parameters  N  and  p,  (unknown).  Let  pjj)  <  ...  <  p[*j  denote  the  ordered 
values  of  the  parameters  pi, . .  ■ , Pfc.  It  is  assumed  that  the  exact  pairing  between  the 
ordered  and  the  unordered  parameters  is  unknown.  Any  population  associated  with  pj*) 
is  considered  as  the  best  population.  A  number  of  statistical  procedures  based  on  single 
sampling  or  sequential  sampling  rules  have  been  studied  in  the  literature  for  selecting  the 
best  binomial  population.  Sobel  and  Huyett  (1957)  have  studied  a  fixed  sample  procedure 
through  indifference  zone  approach.  Gupta  and  Sobel  (1960),  Gupta  and  Huang  (1976), 
and  Gupta,  Huang  and  Huang  (1976)  have  studied  this  selection  problem  using  a  subset 
selection  approach.  Bechhofer  and  Kulkarni  (1982)  and  Kulkarni  and  Jennison  (1986)  have 
studied  a  sequential  selection  procedure  (also,  see  Gupta  and  McDonald  (1986)  for  some 
new  work  and  an  application). 

Now,  consider  a  situation  in  which  one  will  be  repeatedly  dealing  with  the  same  selec¬ 
tion  problem  independently.  In  such  instances,  it  is  reasonable  to  formulate  the  component 
problem  in  the  sequence  as  a  Bayes  decision  problem  with  respect  to  an  unknown  prior  dis¬ 
tribution  on  the  parameter  space,  and  then,  use  the  accumulated  observations  to  improve 
the  decision  rule  at  each  stage.  This  is  the  empirical  Bayes  approach  of  Robbins  (1956, 
1964  and  1983).  Recently,  Gupta  and  Liang  (1987)  have  studied  the  problem  of  selecting 
the  best  binomial  population  by  using  the  nonparametric  empirical  Bayes  approach.  They 
assume  that  the  form  of  the  prior  distribution  is  completely  unknown.  However,  in  many 
cases,  an  experimenter  may  have  some  prior  information  about  the  parameters  of  interest, 
and  he  would  like  to  use  this  information  to  make  appropriate  decisions.  Usually,  it  is 


suggested  that  the  prior  information  is  quantified  through  a  class  of  subjectively  plausible 
priors.  In  view  of  this  situation,  in  this  paper,  it  is  assumed  that  the  binomial  parameters 


Pi,  *  =  I follow  some  conjugate  beta  prior  distributions  with  unknown  hyperpa¬ 
rameters.  Under  the  binomial-beta  statistical  framework,  an  empirical  Bayes  selection 
rule  is  proposed.  It  is  shown  that  the  proposed  empirical  Bayes  selection  rules  possess  the 
following  asymptotic  optimality  property:  The  Bayes  risk  of  the  proposed  empirical  Bayes 
selection  rules  converges  to  the  minimum  Bayes  risk  with  rate  of  convergence  at  least  of 
order  0(exp(— cn))  for  some  positive  constant  c,  where  n  is  the  number  of  accumulated 
past  experience  (observations)  at  hand. 


2.  A  Bayesian  Formulation  of  the  Selection  Problem 

Let  it i, . . . ,  x*  denote  k  populations,  each  consisting  of  N  trials.  For  each  i  =  1  ,...,&, 

let  pi  be  the  probability  of  success  for  each  independent  trial  in  x,-,  and  let  X,  denote 

the  number  of  successes  among  the  associated  N  trials.  Then,  conditional  on  p,,  Xj  is 

binomially  distributed  with  probability  function  /,(x,jp,)  =  (£)p*'(  1  —  Pt)n-X*»  x,  = 

k 

0,1,..., N.  Let  /(x|p)  =  f[  /,(x,|p,),  where  x  -  (xi,...,ifc)  and  p  =  (pi,...,pfc).  For 

t=i 

each  p,  let  pj,-j  <  . . .  <  pj/tj  be  the  ordered  values  of  the  parameters  Pi , . . . ,  p*.  It  is  assumed 

that  the  exact  pairing  between  the  ordered  and  the  unordered  parameters  is  unknown.  A 

population  x,-  with  p<  =  p[*.j  is  considered  as  a  best  population.  Our  goal  is  to  derive  an 

empirical  Bayes  rule  to  select  the  best  binomial  population. 

Let  n  =  {p|p  =  (pi,...,Pfc),  Pi  €  (0,1),  *  =  1  ,...,£}  be  the  parameter  space.  It  is 

assumed  that  the  parameter  p  has  a  prior  distribution  G  with  a  joint  probability  density 
k 

function  g( p)  =  ff»(Pi)»  where  for  each  t  =  1, . . . ,  k, 

»= l 


9i{pi)  = 


r  (*.) 


r(a,/x,)r(a,(i  -  pi)) 


pr"~I(i-Pir(i~*)'1. 


2 


(2.1) 


and  where  0  <  m  <  l,a<  >  0,  both  m  and  a,  are  unknown.  Thus,  we  call  the  statistical 


model  under  study  as  a  binomial-beta  model. 

Let  A  =  {«|t  =  1, ...,1b}  be  the  action  space.  When  action  i  is  taken,  it  means  that 
population  is  selected  as  a  best  population.  For  the  parameter  p  and  action  t,  the  loss 
function  L(p,  t)  is  defined  as 

£(2,0  =  P[fc]  ~Pi, 


the  difference  between  the  best  and  the  selected  population. 

Let  X  =  (Xi, . . . , Xk)  and  let  X  be  the  sample  space  generated  by  X.  A  selection 

rule  d  =  (dx,...,djb)  is  a  mapping  from  the  sample  space  X  to  [0,1]*  such  that  for  each 

observation  z  =  (zx, . . .  ,1*)  in  X,  the  function  d(x)  =  (di(z), . . .  ,  dfc(z))  satisfies  that 

k 

0  <  <U(x)  <  1,  i  =  l,...,fc,  and  £  d,(z)  =  1.  Note  that  d,(z),  i  =  1, . . . ,fc,  is  the 

*=1 

probability  of  selecting  the  population  tt,  as  the  best  population  when  z  is  observed. 

Let  D  be  the  class  of  all  selection  rules  defined  above.  For  each  d  £  D,  let  r(G,  d) 
denote  the  associated  Bayes  risk.  Then,  r(G)  =  jnf^  r(G,d)  is  the  minimum  Bayes  risk. 
From  (2.1)  and  (2.2),  the  Bayes  risk  associated  with  the  selection  rule  d  is: 


(G,d)  =  f  ^  L(2,<*(5))/(?l2)<*G(p) 

Jn  zer 


«*-E 

xer 


J2di{z)vi(zi) 


L»=l 


/(?)> 


(2.3) 


where  /(z)  =  \\  /,-(*<),  /,(zf)  =  /J  /i(x,|p)g,(p)dp,  =  ^[p»|*i]  =  (x,  +  a,/it)/(N  + 

i=i 


o,),  the  posterior  mean  of  pi  given  X,  =  x„  C  =  £  /n  P[*]p(p|x)dg/(z),  being  a  constant, 

zer 

and  ff(pjz)  is  the  posterior  joint  density  function  of  p  given  X  =  x. 

3 


For  each  x  €  X ,  let 

j  /  \  f  .|®t  t  ®iMi  ®»‘  "t"  ®t Afj  i  ,  > 

'  #  -  1  N  +  a,  i<j<fc  N  +  otj  1  v  ’ 

Thus,  &  randomized  Bayes  selection  rule,  say  da  =  (diG»  •  •  •  ,^*g),  can  be  obtained  as 


follows: 


d,c(x)  =  ( lAte)l  1  iUe 

( 0  otherwise. 


where  |A(x)|  denotes  the  cardinality  of  the  set  A(x). 

Note  that  the  Bayes  selection  rule  dc  is  dependent  of  the  values  of  the  parameters 
(a,-,/ii),  i  =  1, ...,fc.  However,  since  the  values  of  these  parameters  are  unknown,  it  is 
impossible  to  apply  the  Bayes  selection  rule  da  for  the  selection  problem  at  hand.  As  we 
mentioned  above,  we  study  this  selection  problem  via  empirical  Bayes  approach. 

3.  The  Proposed  Empirical  Bayes  Selection  Rule 

For  each  i  =  1,. . .  ,k,  at  stage  j,  consider  N  independent  trials  from  population  x,  . 

Let  Xu  stand  for  the  number  of  successes  among  the  N  trials.  Let  p,y  stand  for  the 

probability  of  a  success  for  each  of  the  N  trials.  Let  Pj  =  (Piy, . . . ,  P*y).  We  assume  that 

k 

Pj>  j  =  1,2, ...  are  iid  with  a  prior  density  g(j>)  —  <7t(p»)>  where  »  =  1, . . . ,  k,  are 

i=l 

given  by  (2.1).  Conditional  on  P,y  =  pij,  A'iyjpiy  ~  B[N,pij).  Let  Xj  =  (Xiy, . . . ,  Xkj) 
denote  the  random  observations  at  the  jth  stage,  j  =  1, . . . , n.  We  also  let  Xn+i  =  X  = 
(Xj, . . . ,  Xk)  denote  the  random  observation  at  the  present  stage. 

Under  the  binomial-beta  statistical  model,  we  have,  for  each  »  =  1  ,...,£, 


r  E\Xi/N]  =  Hi 

l  £((Xi/iV)2]  =  m/N  +  (a,M,  +  1  )m{N  -  1  )/(N{cti  +  1))  =  m,2  (say). 


I 


4 


inwrj*.  r%F  rw  vuwiwwfMW  mi  lrjTPUwv^. 


From  (3.1),  through  direct  computation,  the  parameter  a,  can  be  written  as  o,  =  Bi/Ai , 


where 


(  Bi  =  m  -  n%2 

\  =  Hi2  ~  H%N~X  +  Hi  N~x  -  $• 


Note  that  under  the  binomial-beta  model,  0  <  ( Xi/N )2  <  Xi/N  <  1  and  therefore  B,  >  0 
since  Xi  is  a  non-degenerate  random  variable.  Also,  A,  >  0  since  a,  >  0.  Thus,  Hi  and 
Hi2  satisfy  the  following  inequalities:  m»N_1  ~h\N~x  +  h*  <  H%2  <  Hi-  From  (3.2),  a,  can 
be  viewed  as  a  function  of  Hi  and  Hi2  for  Hi  £  (0,  1)  and  Hi2  €  {HiN~x  —  Hi  N~x  +  Hit  Hi)- 
For  each  fixed  Hit  <*»  is  decreasing  in  Hi2  and  lim  a*  =  0,  lim  a,-  =  oo,  where 

Mia- ‘Mi  Mia-*<»i 

a,  =  HiN~x  -  HiN _1  +  Hi- 

Let  Hin  and  Hi2n  be  the  moment  estimators  of  Hi  and  Hi2t  respectively,  based  on  the 
n  past  observations  at  hand.  That  is, 


m  .  =  ;E  X„/N, 

j=i 

*».  =  it  (Xi,/N)\ 


Also,  let 


f  Ain  —  Hi2n  ~  Hi 
l  Bin  =  Hin  ~  Hi2 


-  HinN-'  +  rfN-'  -  «?„, 


Then,  we  propose  some  empirical  Bayes  estimators  for  the  unknown  parameter  a,  and  the 
posterior  mean  <Pi(xi)  =  (x*  -I-  a,Mi)/(N  +  o^)  as  follows: 


0*n  —  | 

<Pin{Xi)  =  |  ^Z*  + 
l  Min 


Bin! Ain  if  Ain  ^  0, 

oo  otherwise; 


+  ClinHin) / {N  +  a,n)  if  ain  <  oo, 

if  a^n  =  oo. 


We  then  propose  an  empirical  Bayes  selection  rule  d*  =  (djn, ...» d*kn)  for  the  selection 
problem  under  study,  as  follows: 


VTV  ''V  vv 


For  each  x  €  X ,  let 

^n(?)  =  {‘>m(l.)  =  maX^/nfo)}  (3.7) 

and  for  each  i  =  1, . . . ,  k,  let 

dt*_(x)  =  {  l^n(?)l  1  e  ^n(?)>  (3.8) 

\  0  otherwise. 

We  denote  the  associated  Bayes  risk  of  the  proposed  empirical  Bayes  selection  rule  d*  by 
r(G,d*).  Then,  from  (2.3), 

r  k 

r(G,i'n)  =  C-  Y,  /(*)•  (3.9) 

xer  L.=i 

Remarks 

1.  Note  that  a,  =  oo  ==>  Var(P,)  =  0,  which  means  that  the  prior  density  ff,(p,)  is 
degenerate  at  the  point  pi  =  p,.  In  this  situation,  the  posterior  mean  <Pi(xi)  —  m- 
Hence,  it  is  reasonable  to  estimate  y?,(x«)  by  /x,n  when  am  =  oo.  We  consider  the 
case  where  a,-  =  oo  as  an  extreme  case  for  the  family  of  beta  distributions. 

2.  Definition  3.1.  A  selection  rule  d  =  (d\,...,dk)  is  said  to  be  monotone  if  for  each 
i  =  1  ,...,fc,  d{(x)  is  nondecreasing  in  x,  while  all  the  other  variables  xy  are  kept 
fixed,  and  nonincreasing  in  x}  for  each  j  ±  i  while  all  the  other  variables  are  kept 
fixed. 

For  the  fixed  past  observations  Xi,...,Xn,  we  see  from  (3.6)  that  for  each  »  = 
1  ,...,fc,  <fiin{xi),  the  estimator  of  the  posterior  mean  £>,(x,),  is  increasing  in  x,.  Thus, 
from  (3.7)  and  (3.8),  one  can  see  that  the  proposed  empirical  Bayes  selection  rule  d* 
possesses  the  monotone  property. 


4.  Asymptotic  Optimality  of  the  Selection  Rules  {d* } 

Consider  an  empirical  Bayes  selection  rule  dn  =  ( d\n , .  .  ,,dfcn).  Let  r(G,  dn)  be  the 
associated  Bayes  risk.  Then,  r(G,  dn)  —  r(G)  >  0  since  r(G)  is  the  minimum  Bayes  risk. 
Thus  E[r(G,dn)}  —  r(G)  >  0,  where 

f  k 

E{r(G,dn)]  =  C-J^  /(?)  (4-1) 

xex  L«=i 

and  the  expectation  E[din{x)\  is  taken  with  respect  to  (Xi,. . . ,  Xn).  The  nonnegative 
difference  E[r(G,  dn)]  —  r(G)  is  always  used  as  a  measure  of  performance  of  the  selection 
rule  dn. 

Definition  4.1.  A  sequence  of  empirical  Bayes  rules  {dn}£Lj  is  said  to  be  asymptotically 
optimal  at  least  of  order  /?„  relative  to  the  unknown  prior  distribution  G  if  E\r(G,dn)\  - 
r(G)  <  0(/3„)  as  n  — ►  oo,  where  {/?n}  is  a  sequence  of  positive  values  such  that  lim  0n  =  0. 

n — »oo 

In  order  to  investigate  the  asymptotic  optimality  of  the  empirical  Bayes  selection  rules 
{d* },  we  need  the  following  lemmas. 

Lemma  4.1.  If  random  variables  Yi,...,Y„  are  iid  such  that  a  <  Yi  <  b,  i  =  1, . . . ,  fc, 
then  for  each  t  >  0, 

P{Y  —  n  >  t}  <  exp{-2nt2/(6  —  a)2}, 

where  Y  =  L  £  Yj ,  and  /x  =  E\Y]. 
y=i 

Proof:  This  lemma  is  a  special  case  of  Theorem  1  of  Hoeffding  (1963). 

Lemma  4.2.  Let  /x*2»  Min  and  /x,2n  be  as  defined  in  (3.1)  and  (3.3),  respectively. 
Then,  for  any  c  >  0, 
a)  P {M»n  -  Mi  <  -e}  <  0(exp(-2nc2)), 


rjv-.-'rfuwKyv.  jnunufxxnufxjrw>  v/x-z.c.v.v/a.'vv. 


b)  jP{Mtn  -  Hi  >  c)  <  0(exp(~  2 nc2)), 

c)  P{t*i2n  ~  Hi2  <  -c}  <  0(exp(— 2nc2))  and 


d)  P{»x2n  ~  Mi2  >  c}  <  0(exp(-2nc2)). 

Proof:  Note  that  under  the  framework  of  statistical  model  under  consideration,  Xij/N , 

j  =  1  are  iid  and  0  <  Xij/N  <  1.  Then,  0  <  ^  Xij/N  <  1.  Thus, 

j  =  i 

P{fJ-in  —  Hi  >  c}  =  0  if  /x,  +  c  >  1,  and  P{/xln  ~  Mi  >  c}  <  exp{— 2nc2}  if  jx,  +  c  <  1, 
which  follows  from  Lemma  4.1.  This  completes  the  proof  of  part  b). 

The  proof  for  the  other  inequalities  are  analogous  and  hence  omitted. 

Lemma  4.3.  Let  Ai,  Bi,  Ain  and  Bin  be  as  given  in  (3.2)  and  (3.4),  respectively. 
Then,  for  any  c  >  0,  we  have 

a)  P{Ain  —  Ai  <  —  c}  <  0(exp(— nc2/8)), 

b)  P{Ain  -  Ai  >  c}  <  0(exp(-nc2/8)), 

c)  P{Bin  —  Bi  <  — c}  <  0(exp(— nc2/2)),  and 

d)  P{Bir  —  Bi  >  c}  <  0(exp(— nc2/2)). 

Proof:  The  techniques  used  to  prove  these  four  inequalities  are  similar.  Here,  we  give  the 
proof  of  part  a)  only. 


Since  0  <  /z*«  <  1,  0  <  Hi  <  1,  and  N  is  a  positive  integer,  then  0  >  -  l)(M»n  + 

Hi)  -  jr  >  {j[  -  1)2  -  jr  =  L=SE-  Therefore, 

-  !)(/*<» + «)  -  - «)  <  -  j> 

<  p{-  w2^(w»-w)  <  -|} 

-  P{Hin  -  Hi)  >  2(2iV  _  } 

<  -P{M»n  -  M,  >  j}- 

Thus, 

P{A,n  -  A,  <  c} 

<  P{Mi2n  ~  Hx2  <  -“>  +  ~  Hi  > 

<  P {.Hi2n  ~  Hi2  —  —  P{Hin  ~  Hi  ^ 

<  0(exp(-nc2/8)), 
which  follows  from  Lemma  4.2. 

For  each  iG  I,  let  A(x)  be  as  defined  in  (2.4),  and  let  B(x)  =  {1,2, . . . ,  fc}\.A(x). 
That  is,  B(x)  is  the  set  consisting  of  the  indices  of  nonbest  populations  given  X  =  x.  Thus, 
for  each  iGl.iG  A{x),  j  €  B(x),  <pi{xi)  >  <Pj(xj).  From  (2.4)  and  (4.1),  following 
straightforward  computation  and  using  the  fact  that  0  <  y?,(x,)  <  1,  0  <  /(x)  <  1,  we  see 
that  for  the  empirical  Bayes  selection  rule 

0  <E[r(G,d*n))-r(G) 

<  X]  E  E  P&iniXi)  <  <Pjn{Xj)}.  (4.2) 

xer  i€A(X)}€B(X) 


Since  the  sample  space  X  is  finite  and  for  each  x  €  X ,  \A(x)\  -f-  |B(x)|  =  k,  therefore, 
it  suffices  to  evaluate  the  asymptotic  behavior  of  the  probability  P{<Pin{xi)  <  IPj  »(*>)) 


where  *  e  A(x),  j  e  B(x).  Now,  for  each  x  G  X,  i  e  A(x),  j  €  f?(x), 

P{<Pin{Xi)  <  <Pjn(x})) 

=  P{<Pin{xi)  <  <Pjn(xj)  and  (a,„  <  oo  and  a :y„  <  oo)}  (4.3) 

+  P{<Pin{xi)  <  <pjn{xj)  and  (a»n  =  OO  or  ayn  =  oo)}. 

Let 

a  =  min{A,|t  =  1,...,*}  (4.4) 

where  Ai,  i  =  1, . . . ,  fc,  are  defined  in  (3.2) .  Then,  a  >  0  since  A,  >  0  for  all  *  =  1, . . . ,  k, 
and  A;  is  a  finite  number. 

Lemma  4.4.  For  each  x  G  X,  t  G  A(x),  j  G  £(x), 

P{<Pin{xi)  <  <P]n(xj)  and  (a,„  =  oo,  or  ayn  =  oo)}  <  0(exp(-na2/8)). 

Proof:  Note  that 

P{<Pin{xi)  <  <pjn(xj)  and  (a,n  =  oo  or  ajn  =  oo)} 

<  P{ain  =  oo}  +  P{ajn  =  oo} 

=  P{Ain  <  0}  +  P{Ajn  <  0} 

=  P{Ain  -  Ai  <  -Ai}  +  P{A}n  -  Ay  <  -Ay} 

<  P{Ain  -  Ai  <  -a}  +  P { Ay n  -  Ay  <  -a} 

<  0(exp(-na2/8)), 
which  is  obtained  from  Lemma  4.3. 

For  each  *  =  1,...,*,  and  n  =  1,2,...,  let  C,(x,)  =  x^A*  +  B,m»,  A  =  NA,  + 
B»,  f/m(x*)  =  x»Ajn  +  BinHin  and  Din  =  N  A»n  "h  Sin-  Also,  let 

b  =  m\n{Ci(xi)Dj  -  Cy(xy)D,|*  €  A(x),  j  G  B(x)}.  (4.5) 
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Then,  6  >  0  which  is  a  consequence  of  the  definitions  of  the  sets  A{x)  and  B(x)  and  the 
fact  that  the  sample  space  X  is  a  finite  space.  Thus,  for  i  G  A(x),  j  G  B(x), 


P{<Pin{xi)  <  <Pjn{Z])  and  (ain  <  oo  and  ajn  <  oo)} 

=  P{Cin{xi)Djn  —  Cjn(xj)Din  <  0} 

<  P{[Cin(xi)Djn  -  Cjn(xj)Din\  -  {Ci(xi)Dj  -  Cj{xj)Di\  <  -b} 

<  P{Cin(xi)Djn  -  Ci(xi)Dj  <  -|}  +  P{Cjn{xj)Din  -  Cj[xj)Di  >  b-}. 


Now, 


P{Cin{xi)Djn  -  Ci(xi)Dj  < 

=  P{\Cin{*i)DJm  -  Cin(Xi)Dj ]  +  \Cin(xi)Dj  -  CiixjDj]  < -b-} 
<  P{Cin(Xi)[Djn  -  Dj)  <  -j)  +  P{[C<n(x<)  -  Ci{xi)\Dj  <  -  j}. 

4  4 


Similarly, 


P{C}n{xj)Din  Cj{lj)Di  —  2^ 

—  P{Cjn{xj){Din  —  D{)  >  — }  +  P{\Cjn(xj)  ~  Cj{.xj)\Di  ^  j}* 


Lemma  4.5. 

a)  P{Cin(Xi)[Djn  -  Dj)  <  <  0(exp(-6(n))), 

b)  P{Cin{xi)\Djn  -  Dj)  >  J}<  0(exp(-6(n))), 

c)  P{\Cin{xi)  ~  Ci{xi))Dj  <  -£}  <  0{exp(-6(n))),  and 

d)  P{[Ctn(ij)  -  Ci(xi))Dj  >  |}  <  0(exp(— 6(n))),  where  b{n)  =  n62/(512JV2(JV  +  l)2). 
Proof:  We  prove  part  a)  and  c)  only.  Proofs  for  b)  and  d)  are  similar. 


S: 

$ 

y 


;s 


i , i .1  *>.1  (j'Lil t**^**!  »%4  >*1  >«»  g  i 


r\F  rv*JWU*  ruf  njc  rurruinjr  * 


P{C,„W[C,„  -i>,|<-  j} 

=  P{(x,A,n  +  Bi„lii„)(N Ain  +  By„  -  JVAy  -  Bj)  <  -  j} 

<  P{(N  +  l)[N(Ay„  -  Ay)  +  (By,  -  By)]  <  -  j} 

(since  0  <  Z<A*n  +  <  #  +  1) 

<  P{Ayn  -  Ay  <  -8jv(N  +  l)}  +  -  Bi  <  "gpVTl) 

,  n62  .. 

S  0(eXp('5l2ivf5(FW))' 


which  follows  from  Lemma  4.3. 


where 


P{[Ci„(Xi)  -  C<(*()]Dy  <  -  j> 

<  P{C,„(x,)  -  Ci(xj)  <  --^~——}(mnce  0  <  Dy  <  N  +  1) 

—  P{xi(Ain  —  Ai)  +  Hi{Bin  —  Bi)  <  ~  ) 

<  P{Xi(Ain  -  Ai)  <  ~S(N+  1J>  +  PMBin  ~  Bi  <  ~S(N+  j)} 


PMBi*  B^~  s(N  +  1)  ^ 

<  P{Bin  -  Bi  <  ~8(Ar6+7y}(since  0  <  **»  <  l) 

nb 2  .. 
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PMAir.  -  At)  <  -gjjy^Yy)  =  0  if  Xj  =  0 


and  for  x,  >  0, 


P{*.(A~  -  *)  <  -i^} 

<  P{Ain  -  A i  <  -SN{N+1)} 

.  nb 3  .. 

“  bl2N2(N  +  l)2  " 

Thus, 

P{[Cin{xi)  -  Ci{xi))Dj  <  -£}  <  0(exp(-6(n))). 

Therefore,  from  (4.6)  to  (4.8)  and  Lemma  4.5,  we  conclude  that:  For  i  €  A(x),  j  6 

*(*). 

P{<Pin{xi)  <  <Pjn(zj)  and  (atn  <  oo  and  ajn  <  oo)}  <  0(exp(-5(n))),  (4.9) 

where  the  expression  at  the  right-hand-side  of  (4.9)  is  independent  of  the  present  observa¬ 
tion  x. 

Now,  by  the  finiteness  of  the  sample  space  X  and  from  (4.2),  (4.9)  and  Lemma  4.4, 
we  conclude  the  following  theorem: 

Theorem  4.1.  Let  {d* }  be  the  sequence  of  empirical  Bayes  selection  rules  defined 
in  Section  3.  Then, 

£[r(G,d;)]  -  r(G)  <  0(exp(-cn)), 

where  c  =  min(5iaJVa>(jv+i)a ,  t)  >  0  and  °  111(1  b  are  defined  in  (4.4)  and  (4.5),  respec¬ 
tively. 

References 

Bechhofer,  R.  E.  and  Kulkami,  R.  V.  (1982).  Closed  adaptive  sequential  procedures  for 
selecting  the  best  of  k  >  2  Bernoulli  populations.  Statistical  Decision  Theory  and 

13 


muhi  wiiu»ui»iuiu»u»u»iwwijui»uw/w  jw  rjuuuwrjuui  wmr*  v*  vn  vy  wwuv  wvwwTJfl  '/^/rvTVT^TQTOngnwwowwrjwipjjnnraxia 


Related  Topics-III  (Eds.  S.  S.  Gupta  and  J.  O.  Berger),  New  York,  Academic  Press, 
Vol.  I,  61-108. 

Gupta,  S.  S.  and  Huang,  D.  Y.  (1976).  Subset  selection  procedures  for  the  entropy  function 
associated  with  the  binomial  populations.  Sankhya  Ser.  A  38,  153-173. 

Gupta,  S.  S.,  Huang,  D.  Y.  and  Huang,  W.  T.  (1976).  On  ranking  and  selection  procedures 
and  tests  of  homogeneity  for  binomial  populations.  Essays  in  Probability  and  Statistics 
(Eds.  S.  Ikeda  et.  al.),  Shinko  Tsusho  Co.  Ltd.  Tokyo,  Japan  501-533. 

Gupta,  S.  S.  and  Liang,  T.  (1987).  Empirical  Bayes  rules  for  selecting  the  best  binomial 
populations.  Statistical  Decision  Theory  and  Related  Topics-IV  (Eds.  S.  S.  Gupta  and 
J.  O.  Berger),  Springer- Verlag,  Vol.  I,  213-224. 

Gupta,  S.  S.  and  McDonald,  G.  C.  (1986).  A  statistical  selection  approach  to  binomial 
models.  Journal  of  Quality  Technology ,  18,  103-115. 

Gupta,  S.  S.  and  Sobel,  M.  (1960).  Selecting  a  subset  containing  the  best  of  several 
binomial  populations.  Contributions  to  Probability  and  Statistics  (Eds.  I.  Olkin  et. 
al.),  Stanford  University  Press,  California,  224-248. 

Hoeffding,  W.  (1963).  Probabilities  for  sums  of  bounded  random  variables.  J.  Amer. 
Statist.  Assoc.  58,  13-30. 

Kulkarni,  R.  V.  and  Jennison,  C.  (1986).  Optimal  properties  of  the  Bechhofer-Kulkarni 
Bernoulli  selection  procedures.  Ann.  Statist .,  14,  298-314. 

Robbins,  H.  (1956).  An  empirical  Bayes  approach  to  statistics.  Proc.  Third  Berkeley 
Symp.  Math.  Statist.  Probab.,  1,  University  of  California  Press,  157-163. 

Robbins,  H.  (1964).  The  empirical  Bayes  approach  to  statistical  decision  problems.  Ann. 


Math.  Statist. ,  35,  1-20. 


Robbins,  H.  (1983).  Some  thoughts  on  empirical  Bayes  estimation.  i4rm.  Statist.  11, 
713-723. 

Sobel,  M.  and  Huyett,  M.  J.  (1957).  Selecting  the  best  one  of  several  binomial  populations. 
Bell  System  Tech.  J.,  36,  537-576. 


la.  REPORT  SECURITY  CLASSIFICATION 

UNCLASSIFIED 


3a.  SECURITY  CLASSIFICATION  AUTHORITY 


DECLASSIFICATION  /DOWNGRADING  SCHEDULE 


REPORT  DOCUMENTATION  PAGE 


lb  RESTRICTIVE  MARKINGS 


3  DISTRIBUTION /AVAILABILITY  OF  REPORT 


4.  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 


Technical  Report  #88-1 


6a.  NAME  OF  PERFORMING  ORGANIZATION 

Purdue  University 


6c  ADDRESS  {City.  State,  and  ZIP  Cod') 

Department  of  Statistics 
West  Lafayette,  IN  47907 


6b  OFFICE  SYMBOL 
(If  applk able) 


S.  MONITORING  ORGANIZATION  REPORT  NUMBER(S) 


7a  NAME  OF  MONITORING  ORGANIZATION 


7b  ADDRESS  (C/iy.  State,  and  ZIP  Cod') 


8a.  NAME  OF  FUNDING /SPONSORING 
ORGANIZATION 

Office  of  Naval  Research 


8c  ADDRESS  (City.  State,  and  ZIP  Cod') 

Arlington,  VA  22217-5000 


8b.  OFFICE  SYMBOL 
(If  applicable) 


9.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 

N00014-84-C-0167, 

N00014-88-K-0170  and  NSF  Grant  DMS-860694 


10.  SOURCE  OF  FUNDING  NUMBERS 


PROGRAM 
ELEMENT  NO. 


PROJECT 

NO. 


WORK  UNIT 
ACCESSION  NO 


11  TITLE  (Includt  Security  Classification) 

Selecting  the  Best  Binomial  Population:  Parametric  Empirical  Bayes  Approach 


12  PERSONAL  AUTHOR(S) 

Shanti  S.  Gupta  and  TaChen  Lian 


13a  TYPE  OF  REPORT 

Technical 


13b.  TIME  COVERED 
FROM  TO 


14.  DATE  OF  REPORT  (Y"r,  Month.  Osy)  hs  PACE  COUNT 

February  1,  1988  I  15 


COSATI  CODES 


GROUP  SUBGROUP 


18  SUBJECT  TERMS  (Continue  on  reverse  if  necessary  and  identify  by  block  number) 

Asymptotically  optimal,  Bayes  rules,  empirical  Bayes  rules, 
best  population,  binomial-beta  model,  rate  of  convergence. 


19  ABSTRACT  ( Continue  on  reverse  if  necessary  and  identify  by  block  number) 

Consider  k  populations  , . , . ,  tt^,  where  an  observation  from  population  tk  has  a  binomial  dis¬ 
tribution  with  parameters  N  and  p.  (unknown).  Let  Dr.-«  =  max  p^.  A  population  tk  with 

1  LKJ  1  <j  <k 


A  population  tk  with 


Pi  ~  P[k]  ca^e<^  a  best  Population.  We  are  interested  in  selecting  the  best  population. 

Let  p  =  ( Pi , . . .  ,p(< )  and  let  a  denote  the  index  of  the  selected  population.  Under  the  loss 

function  L(p,  a)  =  pj-^  -  pfl,  this  statistical  selection  problem  is  studied  via  a  parametric 

empirical  Bayes  approach.  It  is  assumed  that  the  binomial  parameters  p^,  i  =  l,...,k,  follow 

some  conjugate  beta  prior  distributions  with  unknown  hyperparameters.  Under  the  binomial-beta 
statistical  framework,  an  empirical  selection  rule  is  proposed.  It  is  shown  that  the  Bayes 
risk  of  the  proposed  empirical  Bayes  selection  rule  converges  to  the  corresponding  minimum 
Bayes  risk  with  rates  of  convergence  at  least  of  order  0(exp(-cn))  for  some  positive  constant 


20  DISTRIBUTION /AVAILABILITY  OF  ABSTRACT  21  ABSTRACT  SECURITY  CLASSIFICATION 

CDunclassifiedajnlimiteo  □  SAME  AS  RPT  □  DTIC  USERS  UNCLASSIFIED 


22*  NAME  OF  RESPONSIBLE  INDIVIDUAL  |22b  TELEPHONE  (Include  Area  Code)  22c  OFFICE  SYMBOL 

nti  S.  Gupta  _  I  317-494-6031 


OD  FORM  1473. 84  MAR  83  APR  edition  may  t*u,edunt, I  exhausted  security  ClaSS.F.CATION  of  -his  page 

All  other  editions  are  obiolete.  - - - — —  -- 


jJjrl 


19.  c,  where  n  is  the  number  of  accumulated  past  experience  (observations)  at  hand. 


s 


